Efficient Cycle-Accurate Simulation of the Ultrasparc III CPU
نویسندگان
چکیده
This paper presents a novel technique for cycleaccurate simulation of the Central Processing Unit (CPU) of a modern superscalar processor, the UltraSPARC III Cu processor. The technique is based on adding a module to an existing fetch-decode-execute style of CPU simulator, rather than the traditional method of fully modelling the CPU microarchitecture. It is also suitable for accurate SMP modelling. The main functions of the module are the simulation of instruction grouping, register interlocks and the store buffer. Its simple table-driven implementation permits easy modification for exploring microarchitectural variations. The technique results in a 40% loss of simulation speed, instead of a 10 times or greater performance loss by fully implementing the detailed micro-architecture. The technique is validated against an actual UltraSPARC III Cu processor, and achieves high levels of accuracy over a range of scientific benchmarks.
منابع مشابه
UltraSPARC-II/: expanding the boundaries of a system on a chip
he central mission of the UltraSPARC-IIi is optimized price/performance and ease of use for the system designer. Stated differently, the CPU must deliver a lot of performance for the least impact on overall system cost and also enable simplified system design. An important strategy in enabling low-cost SPARC-based systems is to leverage the PC industry economy of scale by using the industry-sta...
متن کاملEvaluation of CPU architecture by simulation technologies and benchmark computer systems
The processor architecture designers face major challenges to improve the processor’s performance. To measure the performance of the processor there are many parameter like performance of cache, TLB, IO operations, bus speed etc. Different companies launch series of processors with same base configuration and a little change of variations base on cache size, cache levels, share and separate cac...
متن کاملModelling Time in Full System Simulation
Simulation of computer systems is becoming increasingly important in areas such as code optimization, performance evaluation and processor and memory system design. An important trade-off in simulator design is performance vs. abstraction of the model. Simics is a full system simulator, capable of simulating an entire computer system, including the processor, TLB, MMU, memory hierarchy and surr...
متن کاملTable 2: Cpu times on an Ultrasparc 2 C Omputer, Number of Oating Point Operations And
This paper introduces a fast and high performance subsampling method using feedforward neural networks. Experimental results using still images are evaluated subjectively and objectively. The results show that our method outperforms, in all cases taken into account, the standard lowpass ltering and subsampling method. Our method's main advantages are high speed, good quality of the nal images w...
متن کاملData Access Characteristics and Optimizations for Sun UltraSPARC T2 and T2+ Systems
Processor and system architectures that feature multiple memory controllers and/or ccNUMA characteristics are prone to show bottlenecks and erratic performance numbers on scientific codes. Although cache thrashing, aliasing conflicts, and ccNUMA locality and contention problems are well known for many types of systems, they take on peculiar forms on the new Sun UltraSPARC T2 and T2+ processors,...
متن کامل